AITopics | encoder part

Collaborating Authors

encoder part

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

4d7e0d72898ae7ea3593eb5ebf20c744-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 09:06:23 GMT

CFQ is the only realistic benchmark that comprehensively measure compositional generalization.

artificial intelligence, compositional generalization, natural language, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.31)

Add feedback

4d7e0d72898ae7ea3593eb5ebf20c744-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 21:17:52 GMT

CFQ is the only realistic benchmark that comprehensively measure compositional generalization.

artificial intelligence, compositional generalization, natural language, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.31)

Add feedback

Satellite imagery segmentation using U-NET

#artificialintelligenceOct-25-2022, 13:20:26 GMT

In this blog, we will conduct picture segmentation on a very limited dataset using U-Net, a popular segmentation CNN model. There will also be some customized loss functions used for training reasons, such as dice loss and Jaccard index metrics. The data that we will be working with comes from kaggle. The dataset is called Semantic segmentation of aerial imagery. The dataset has two sorts of files .jpg

dataset, satellite imagery segmentation, segmentation, (16 more...)

#artificialintelligence

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Text Classification using Transformers

#artificialintelligenceMar-18-2021, 05:55:44 GMT

In this part, we will try to understand the Encoder-Decoder architecture of the Multi-Head Self-Attention Transformer network with some code in PyTorch. There won't be any theory involved(better theoretical version can be found here) just the barebones of the network and how can one write this network on its own in PyTorch. The architecture comprising the Transformer model is divided into two parts -- the Encoder part and the Decoder part. Several other things combine to form the Encoder and Decoder parts. Let's start with the Encoder.

decoder part, encoder part, text classification, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)

Add feedback

A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules

Taghian, Mehran, Asadi, Ahmad, Safabakhsh, Reza

arXiv.org Artificial IntelligenceJan-8-2021

A wide variety of deep reinforcement learning (DRL) models have recently been proposed to learn profitable investment strategies. The rules learned by these models outperform the previous strategies specially in high frequency trading environments. However, it is shown that the quality of the extracted features from a long-term sequence of raw prices of the instruments greatly affects the performance of the trading rules learned by these models. Employing a neural encoder-decoder structure to extract informative features from complex input time-series has proved very effective in other popular tasks like neural machine translation and video captioning in which the models face a similar problem. The encoder-decoder framework extracts highly informative features from a long sequence of prices along with learning how to generate outputs based on the extracted features. In this paper, a novel end-to-end model based on the neural encoder-decoder framework combined with DRL is proposed to learn single instrument trading strategies from a long sequence of raw prices of the instrument. The proposed model consists of an encoder which is a neural structure responsible for learning informative features from the input sequence, and a decoder which is a DRL model responsible for learning profitable strategies based on the features extracted by the encoder. The parameters of the encoder and the decoder structures are learned jointly, which enables the encoder to extract features fitted to the task of the decoder DRL. In addition, the effects of different structures for the encoder and various forms of the input sequences on the performance of the learned strategies are investigated. Experimental results showed that the proposed model outperforms other state-of-the-art models in highly dynamic environments.

candlestick, reinforcement, trading strategy, (14 more...)

arXiv.org Artificial Intelligence

2101.03867

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Understanding Autoencoders with Information Theoretic Concepts

Yu, Shujian, Principe, Jose C.

arXiv.org Machine LearningMar-30-2018

Despite their great success in practical applications, there is still a lack of theoretical and systematic methods to analyze deep neural networks. In this paper, we illustrate an advanced information theoretic methodology to understand the dynamics of learning and the design of autoencoders, a special type of deep learning architectures that resembles a communication channel. By generalizing the information plane to any cost function, and inspecting the roles and dynamics of different layers using layer-wise information quantities, we emphasize the role that mutual information plays in quantifying learning from data. We further propose and also experimentally validate, for mean square error training, two hypotheses regarding the layer-wise flow of information and intrinsic dimensionality of the bottleneck layer, using respectively the data processing inequality and the identification of a bifurcation point in the information plane that is controlled by the given data. Our observations have direct impact on the optimal design of autoencoders, the design of alternative feedforward training methods, and even in the problem of generalization.

artificial intelligence, information, machine learning, (19 more...)

arXiv.org Machine Learning

1804.00057

Country:

North America > United States (0.46)
Europe (0.28)
Asia > China (0.28)

Genre: Research Report (0.50)

Industry:

Education (0.67)
Information Technology > Software (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Autoencoder

#artificialintelligenceJun-20-2016, 08:25:38 GMT

Goal Autoencoder have long been proposed to tackle the problem of unsupervised learning. In this week's summary we have a look at their capabilities of providing a features that can be successfully used in supervised tasks and sketch their framework architecture. Motivation In supervised learning, back in the days, deeper architectures need some kind of pretraining of layers before the actual supervised tasked could be pursued. Autoencoder came in handy for this and allowed to train one layer after the other and were able to find useful features for the supervised learning. Steps Let us start by looking at the general architecture.

artificial intelligence, machine learning, representation, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback